Sparc: a sparsity-based consensus algorithm for long erroneous sequencing reads
نویسندگان
چکیده
منابع مشابه
Sparc: a sparsity-based consensus algorithm for long erroneous sequencing reads
Motivation. The third generation sequencing (3GS) technology generates long sequences of thousands of bases. However, its current error rates are estimated in the range of 15-40%, significantly higher than those of the prevalent next generation sequencing (NGS) technologies (less than 1%). Fundamental bioinformatics tasks such as de novo genome assembly and variant calling require high-quality ...
متن کاملDBG2OLC: Efficient Assembly of Large Genomes Using Long Erroneous Reads of the Third Generation Sequencing Technologies
The highly anticipated transition from next generation sequencing (NGS) to third generation sequencing (3GS) has been difficult primarily due to high error rates and excessive sequencing cost. The high error rates make the assembly of long erroneous reads of large genomes challenging because existing software solutions are often overwhelmed by error correction tasks. Here we report a hybrid ass...
متن کاملA consistency-based consensus algorithm for de novo and reference-guided sequence assembly of short reads
MOTIVATION Novel high-throughput sequencing technologies pose new algorithmic challenges in handling massive amounts of short-read, high-coverage data. A robust and versatile consensus tool is of particular interest for such data since a sound multi-read alignment is a prerequisite for variation analyses, accurate genome assemblies and insert sequencing. RESULTS A multi-read alignment algorit...
متن کاملA permutations-based algorithm for fast alignment of long paired-end reads
random-permutations-based algorithm is presented for alignment of long paired-end reads. An implementation of the algorithm is shown to process a million pairs of 100 − 500bp-long reads in 3-10 minutes on a single CPU, correctly aligning more reads than popular fast aligners, 5-100 times faster. Approved for public release: distribution is unlimited.
متن کاملSensitive Long-Indel-Aware Alignment of Sequencing Reads
The tremdendous advances in high-throughput sequencing technologies have made population-scale sequencing as performed in the 1000 Genomes project [30] and the Genome of the Netherlands project1 possible. In order to keep up with everincreasing sequencing speeds, research on fast read mapping algorithms was indicated and led to substantial advances in the last decade. Refer to [19] for a review...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: PeerJ
سال: 2016
ISSN: 2167-8359
DOI: 10.7717/peerj.2016